Rank in Wordlist | Frequency | Word |
---|---|---|
3971 | 43 | 1,000 |
4046 | 42 | 2,000 |
4939 | 32 | 10,000 |
5784 | 26 | 5,000 |
5964 | 25 | 100,000 |
6582 | 22 | 30,000 |
7079 | 20 | 20,000 |
7358 | 19 | 3,000 |
7360 | 19 | 40,000 |
7655 | 18 | 1,500 |
Rank in Wordlist | Frequency | Word |
---|---|---|
10509 | 12 | c(h)uid |
28862 | 3 | c(h)ostas |
28863 | 3 | c(h)éile |
28864 | 3 | c(h)ónaí |
37885 | 2 | T(h)aoiseach |
39431 | 2 | c(h)inneadh |
39432 | 2 | c(h)omhair |
39433 | 2 | c(h)orp |
39434 | 2 | c(h)umas |
41472 | 2 | f(h)inné |
Rank in Wordlist | Frequency | Word |
---|---|---|
10509 | 12 | c(h)uid |
21501 | 4 | 0)1 |
25626 | 3 | 0)86 |
28862 | 3 | c(h)ostas |
28863 | 3 | c(h)éile |
28864 | 3 | c(h)ónaí |
32501 | 2 | 0)28 |
32502 | 2 | 0)53 |
32503 | 2 | 0)66 |
32504 | 2 | 0)7759 |
Rank in Wordlist | Frequency | Word |
---|---|---|
4303 | 39 | 50% |
4836 | 33 | 20% |
5783 | 26 | 25% |
6583 | 22 | 40% |
6584 | 22 | 5% |
7072 | 20 | 10% |
8334 | 16 | 80% |
8335 | 16 | 90% |
9185 | 14 | 60% |
10242 | 12 | 1% |
Rank in Wordlist | Frequency | Word |
---|---|---|
21704 | 4 | B&Q |
33720 | 2 | B&B |
53545 | 1 | 3&4 |
57121 | 1 | B&H |
57542 | 1 | Ben&Jerry |
57996 | 1 | Black&Tan |
61244 | 1 | D&G |
62497 | 1 | E&P |
62941 | 1 | F&F |
63241 | 1 | Fhast&Furious |
Rank in Wordlist | Frequency | Word |
---|---|---|
21499 | 4 | $1 |
21500 | 4 | $100 |
25624 | 3 | $10 |
25625 | 3 | $15 |
32491 | 2 | $1.65 |
32492 | 2 | $100,000 |
32493 | 2 | $12 |
32494 | 2 | $2 |
32495 | 2 | $250,000 |
32496 | 2 | $3 |
Rank in Wordlist | Frequency | Word |
---|---|---|
1296 | 159 | b'fhéidir |
1637 | 123 | d'fhéadfadh |
1676 | 120 | B'fhéidir |
1784 | 113 | d'aois |
2308 | 83 | D'fhéadfadh |
2552 | 75 | d'éirigh |
3017 | 61 | d'fhostóir |
3668 | 48 | de'n |
3798 | 46 | d'aon |
3939 | 44 | d'fhág |
Rank in Wordlist | Frequency | Word |
---|---|---|
3827 | 46 | sé/sí |
5219 | 30 | SEO/AN |
5227 | 30 | aige/aici |
5831 | 26 | agus/nó |
7164 | 20 | air/uirthi |
7650 | 19 | é/í |
9027 | 15 | gné-ailt/tuairim |
9186 | 14 | 9/11 |
10614 | 12 | dó/di |
11504 | 11 | leis/léi |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots